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ABSTRACT 

To investigate the fundamental principles of H2 formation in a giant molecular cloud, we 
derive the Hi and H2 surface density (Sri and Sh2) images of the Perseus molecular cloud on 
sub-pc scales (^0.4 pc). We use the far-infrared data from the Improved Reprocessing of the 
IRAS Survey and the y-band extinction image provided by the COMPLETE Survey to estimate 
the dust column density image of Perseus. In combination with the Hi data from the Galactic 
Arecibo L-band Feed Array Hi Survey and an estimate of the local dust-to-gas ratio, we then 
derive the Sh2 distribution across Perseus. We find a relatively uniform Ehi ~ G 8 M3 pc^^ for 
both dark and star-forming regions, suggesting a minimum Hi surface density required to shield 
H2 against photodissociation. As a result, a remarkably tight and consistent relation is found 
between Eh2/Shi and Shi + Sh2- The transition between the Hi- and H2-dominated regions 
occurs at iV(Hl) + 2A''(H2) ^ (8-14) x 10^" cm~^. Our findings are consistent with predictions 
for H2 formation in equilibrium, suggesting that turbulence may not be of primary importance 
for H2 formation. However, the importance of warm neutral medium for H2 shielding, internal 
radiation field, and the timescale of H2 formation still remain as open questions. We also compare 
H2 and CO distributions and estimate the fraction of "CO-dark" gas, /dg ^ 0.3. While significant 
spatial variations of /dg are found, we do not find a clear correlation with the mean F-band 
extinction. 



1. Introduction 

Most of the dense molecular gas in galaxies is assembled into giant molecular clouds (GMCs) with masses 
from 10'' M0 to 10^ Mq and radii from 50 pc to a few hundred pc (e.g., Fukui & Kawamura 2010). As stars, 
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the "atoms" of galaxies, are exclusively formed in these GMCs, physical processes that govern the conversion 
of Hi into H2 clearly play an important role in determining the properties of GMCs and consequently the 
initial conditions for star formation. Therefore, understanding the properties of the interstellar regions where 
gas transitions from Hl-H2-dominated (H1-H2 transition hereafter) is an important step toward a complete 
theory of star formation. 

From an observational point of view, the H1-H2 transition has been investigated in the Galaxy mainly 
via ultraviolet (UV) absorption measurements toward background sources. For example, Savage et al. (1977) 
and Gillmon et al. (2006) measured H2 column densities, A^(H2), along the lines of sight to early-type stars 
and AGNs using the Copernicus and FUSE satellites and combined them with Hi column densities, A''(Hi), 
to estimate H2 fraction, / = 27V(H2)/(iV(Hi) + 2Af(H2)). They found that / sharply rises at iV(H) = 
iV(Hl) + 2iV(H2) ~ (3-5) X 10^° cm~^. Several far-infrared (FIR) studies of interstellar clouds have revealed 
the presence of H2 based on the comparison between dust and atomic gas contents (e.g.. Reach et al. 1994; 
Meyerdierks & Heithausen 1996; Douglas & Taylor 2007). Reach et al. (1994) found a significant positive 
deviation from a linear relation between IR intensity at 100 /im and A'^(Hl) for cirrus clouds, indicating a 
substantial amount of H2. The threshold iV(Hl) at which A^(H2) implied by the FIR data becomes significant 
was found to be ~4 x 10^*^ cm~^, consistent with the results from Savage et al. (1977) and Gillmon et al. 
(2006). 

More recently, the H1-H2 transition has been observationally inferred by an absence of high A''(Hi). For 

example, Barriault et al. (2010) observed two high Galactic latitude clouds in the OH and Hi transitions and 
found that A^(Hl) saturates at ~5 x lO'^'^ cm~^ for high values of iV(OH), suggesting that there is molecular 
gas not traced by Hi. Extragalactic studies have found this saturation as well. Wong & Blitz (2002), Blitz 
& Rosolowsky (2004, 2006), Bigiel et al. (2008), and Wong et al. (2009) observed nearby galaxies in the CO 
and Hi transitions and found that Hi surface density, Shi, saturates at ^10 Mq pc^^. Most extragalactic 
studies have a spatial resolution of ~700 pc at best, where most GMCs are unresolved and several are Ukely 
to be blended along a line of sight. In addition, at this resolution it is impossible to compare H2 with the 
corresponding Hi associated with the cloud envelopes. 

From a theoretical point of view, the H1-H2 transition has been studied in the context of the structure 
of photodissociation regions (PDRs). As UV photons filling up the bulk of the interstellar medium (ISM) 
photodissociate H2, predominantly molecular gas is only found in the dense regions where high enough 
shielding is achieved by both dust grains and H2 self-shielding. These molecular regions are bounded by PDRs 
where the gas predominantly exists in the atomic phase (e.g., HoUenbach & Tielens 1997). The structure of 
PDRs has been mainly investigated for ID geometries that include uni-directional or bi-directional beams of 
radiation propagating into semi-infinite slabs or purely radial radiation fields hitting the surfaces of spherical 
clouds (e.g., van Dishoeck & Black 1986; Draine & Bertoldi 1996; Browning et al. 2003). However, this 
ID approximation is not appropriate for GMCs that are embedded in a radiation field produced by the 
combination of many single stars and star clusters. In addition, when GMCs are surrounded by large Hi 
envelopes, which have been frequently observed in the Galaxy (e.g., Wannier et al. 1983, 1991), the curvature 
of the PDRs and the finite size of the molecular regions axe no longer negligible. A higher dimensional 
approach is required in this case, but as it requires a treatment of the angular dependence of the radiation 
field, all previous studies have been purely numerical (e.g., Spaans & Neufeld 1997; Liszt & Lucas 2000; 
Liszt 2002). These numerical studies do not provide a simple analytic estimate of the structure of PDRs 
that allows us to extrapolate the models over a wide range of ISM environments and to easily compare with 
observations. 

Recently, Krumholz et al. (2008, 2009; KMT08 and KMT09 hereafter) performed analytic studies on 
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a PDR in a spherical cloud that is bathed in a uniform and isotropic interstellar radiation field (ISRF). 
They investigated the steady-state equilibrium formation and photodissociation of H2 and approximated the 
H1-H2 transition as being infinitely sharp. Their calculation of where exactly inside a molecular cloud this 
transition happens resulted in two important analytic predictions. First, they found that a certain amount 
of Ehi is required to provide enough shielding for H2 formation. For GMCs with solar metallicity, this is Ehi 
~ 10 Mq pc~^. Once the minimum Ehi is achieved to shield H2 against photodissociation, all excess Hi is 
converted into H2 and the H2 abundance increases. Second, they derived an analytic function for H2-to-Hi 
ratio, i?H2 = Eh2/Ehi, and showed that i?H2 depends on the total gas surface density and metallicity but is 
interestingly almost independent of the strength of the ISRF. This is a very powerful prediction, suggesting 
that for a given ISM environment, the total gas surface density determines the H2-to-Hi ratio. If confirmed 
by observations, this "easy" prescription of H2 formation will be invaluable for numerical simulations of 
galaxy formation and evolution. From a purely practical perspective, as the above predictions are functions 
of direct observables (total gas surface density and metallicity) , they are much easier to test observationally. 
Most of the previous models of II2 formation (e.g., van Dishoeck & Black 1986; Draine & Bertoldi 1996; 
Browning et al. 2003) require the intensity of the UV radiation field and the number density of H nuclei as 
input parameters, which cannot be easily determined by observations. 

KMT09's predictions specifically focus on the properties of GMCs on galactic scales (^kpc). Several 
recent studies investigated the Hi and H2 properties of nearby spiral and dwarf galaxies on kpc scales and 
found reasonable agreement with KMT09's prediction of i?H2 as a function of total gas surface density and 
metallicity (e.g., KMT09; Fumagalli et al. 2009, 2010). On the other hand, KMT09's simplified treatment 
of the thermal and chemical processes in PDRs may be less suited for studies of individual PDRs. For 
example, KMT09's model does not consider the temperature dependence of H2 formation rate coefficients, 
H2 dissociation by cosmic rays, or absorption of Lyman- Werner (LW) photons by molecules other than 
H2. Nevertheless, KMT09's model provides strong physical insights into the fundamental principles of H2 
formation. 

In this paper, we test KMT09's predictions on the Perseus molecular cloud (Perseus hereafter) on sub-pc 
scales (~0.4 pc) for the first time in an attempt to understand the fundamental principles of H2 formation 
in GMCs. Perseus is a nearby molecular cloud located at a distance ranging from 200 pc to 350 pc (Herbig 
& Jones 1983; Cernis 1990) and has roughly a projected angular size of 6° by 3° on the sky (based on CO 
emissiorQ. We adopt the distance to Perseus of 300 pc throughout this paper. Perseus is part of the large 
Taurus- Auriga-Perseus molecular cloud complex and lies below the Galactic plane (6 ^ —20°), resulting in 
slightly less confused Hi spectra compared to other Galactic GMCs. With Al ^ lO** Mq (Sancisi et al. 1974; 
Ungerechts & Thaddeus 1987), this is a low mass cloud with an intermediate level of star formation (Bally 
et al. 2008) . Figure [l] shows the y-band extinction image of Perseus with CO contours to emphasize the 
locations of several prominent dark and star-forming regions. IC348 (age = 0.7-12 Myr; Herbig 1998) and 
NGC 1333 (age < 1 Myr; Lada et al. 1996) are active star-forming regions, while B5, BIE, Bl, L1455, L1448, 
and L1451 are dark regions with low levels of star formation. A wealth of multi-wavelength observations exist 
for Perseus; we provide details in Section 2. One of the recent observational advances is the high resolution 
Hi data set of Perseus and its outskirts provided by the GALFA-Hi survey (Peek et al. 2011). This data set 
nicely matches the angular resolution of FIR images, ^4', allowing us to estimate _Rh2 across and around 
Perseus and to compare the results to KMT09's predictions. 

This paper is organized in the following way. We start with a description of the data sets used in this 
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study (Section 2). We then describe details of our methodology to derive iV(Hl) and iV(H2) images (Sections 
3 and 4). In Section 5, we examine the dust temperature image closely and discuss the implications for the 
strength of the radiation field across Perseus. In Section 6, we summarize KMT09's model and predictions. 
In Section 7, we compare our Shi and i?H2 images with KMT09's predictions. We then compare the derived 
H2 distribution with the existing CO observations and investigate the existence of the "CO-dark" H2 gas 
(Section 8). Finally, we discuss and summarize our conclusions (Sections 9 and 10). 



2. Data 
2.1. Hi 

We use Hi data from the Galactic Arecibo L-band Feed Array Hi Survey (GALFA-Hi). GALFA-Hi 
uses ALFA, a seven-beam array of receivers mounted at the focal plane of the 305-m Arecibo telescope, to 
map Hi emission in the Galaxy. Each of seven dual polarization beams has an effective beamsize of 3.9' x 
4.1' and a gain of 8.5-11 Jy K'^ (Peek et al. 2011). The GALFA-Hl spectrometer, GALSPECT, has a 
velocity resolution of 0.184 km s"^ (872 Hz) and covers -700 km s''^ < v < +700 km s"^ (7 MHz) in the 
Local Standard of Rest (LSR) fram^ 

GALFA-Hi adopts two different scanning modes, the drift and basketweaving modes. In the drift mode, 
the telescope is fixed at azimuth of 0° or 180° and the sky is allowed to drift by. In the basketweaving mode, 
on the other hand, the telescope is fixed at the meridian and is driven up and down in zenith angle over the 
chosen declination (Dec) range. For each day of observations, the starting point of the telescope is shifted in 
right ascension (RA) by ~12' such that the entire proposed region is covered. ALFALFA (Giovanelli et al. 
2005) and GALFACTS (Guram & Taylor 2009), two large area surveys with which GALEA -Hi commensally 
observes, use the drift and basketweaving mode, respectively. 

GALFA-Hi combines Hi data from many individual projects, both in solo and commensal modes. 
Reduction of Hi data is performed with the GALFA-Hi Standard Reduction (GSR) pipeline. Full details 
of GSR are presented in Peek et al. (2011), but here we briefly describe its most important steps. First, 
Hi spectra are corrected for the intermediate frequency (IF) gain by dividing them by a reference spectrum. 
This reference spectrum is obtained via the least-squares frequency switching (LSES) technique (Heiles 
2005). The LSES observation is performed at the beginning of each observing period and the obtained IF 
gain spectrum is used for the whole period's data. Second, the Hi spectra are searched for baseline ripples 
produced by reflections in the signal chain, the primary reflector, and the Gregorian dome. This initial 
calibration is done by examining the Hi data in the Fourier space and removing the Fourier components 
that correspond to the reflection patterns. Third, the gain calibration is applied to the Hi spectra. The 
relative gains of each beam over each day of observations are determined by finding the points where beam 
tracks intersect and comparing the Hi spectra of each beam. This gain calibration is most effective for the 
scans obtained in the basketweaving mode, where a large number of intersection points are produced. Lastly, 
the fully calibrated Hi spectra are gridded onto the image plane in Cartesian coordinates. The gridded Hi 
data are then calibrated for the first sidelobe and the final brightness temperature scale. The lower angular 
resolution Hi data from the Leiden-Argentine-Bonn (LAB) Survey (Kalberla et al. 2005) are used for the 
final brightness temperature calibration. 



■^In this paper, all velocities are quoted in the LSR frame. 
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The GALFA-Hl first data release (DRl) has been presented by Peek et al. (2011). The DRl data cover 
7520 deg^ of sky and have been produced using 3046 hours of data obtained from 12 individual projects. 
RMS noise in a 1 km s~^ channel ranges from 140 mK to 60 mK, with a median of 80 mK. The DRl data 
can be downloaded at https://purcell.ssl.berkeley.edu/. The Hi data used in this paper are part of DRl 
and have been obtained from T0GS2 (Turn-On GALFA Spectrometer2; PI: C. Heiles & S. Stanimirovic) , 
a2004 (Hi Survey of the Taurus Molecular Cloud; PI: P. Goldsmith), and a2174 (Hi Survey of the Perseus 
Molecular Cloud; PI: L. Knee). We combined the scans from these three projects and produced an Hi cube 
centered at (RA,Dec) = (03h29'"52",+30°34'l") in J2OOC0with a size of 14.8° x 9°. 

2.2. Infrared 

We use 60 fim and 100 /im images from the Improved Reprocessing of the IRAS Survey (IRIS) (Miville- 
Deschenes & Lagache 2005) . IRIS provides an improved zero point calibration and correction for the zodiacal 
light and striping in the images, as well as increased angular resolution compared to the original IRAS images. 
The angular resolution at 60 fj,m and 100 /im is 4' and 4.3', respectively, comparable to that of the GALFA- 
Hl data. The noise level in the IRIS images is approximately 0.07 MJy sr"-'^ and 0.48 MJy sr~^ at 60 /im 
and 100 /xm, respectively. 



2.3. CO 

We use the CO integrated intensity image provided by Dame et al. (2001). Dame et al. (2001) combined 
CO surveys of the Galactic plane and local molecular clouds and produced a composite survey of the entire 
Galaxy at an angular resolution of 8.4'. The CO data were obtained by the 1.2-m telescope at the Harvard- 
Smithsonian Center for Astrophysics (CfA). The spectra were sampled with an angular spacing of 7.5' and 
the final data cube has a uniform RMS noise of 0.25 K per a 0.65 km s~^ channel. To estimate CO integrated 
intensity, Iqo, Dame et al. (2001) integrated CO emission from v = —15 km s~^ to -1-15 km s~^. See Section 
2 of Dame et al. (2001) for details of the observations and analysis. 

While we use the CfA CO data for the most of our analysis, ^^CO(l-O) data provided by the COM- 
PLETE survey (Ridge et al. 2006a) are used in Section 7. As ^'^CO(l-O) is a better tracer of dense molecular 
gas than CO, we use the /i3co image with a size of 6.25° x 3° to set the boundaries of the dark and star- 
forming regions in Perseus. The ^^CO(l-O) data were obtained by the 14-m Five College Radio Astronomy 
Observatory (FCRAO) telescope at an angular resolution of 44". To study Perseus, Ridge et al. (2006a) 
integrated ^■^CO(l-O) emission from v = km s^^ to -|-20 km s^^ with a velocity resolution of 0.07 km s^^ 
and estimated an average RMS noise per channel of 0.17 K. 

2.4. Av 

We use the Ay image provided by the COMPLETE survey. This image was constructed from the Two 
Micron All Sky Survey (2MASS) Point Source Catalog using the NICER algorithm (Lombardi & Alves 2001) 
at an angular resolution of 5'. The NICER algorithm estimates reddening along a line of sight by comparing 



^In this paper, all coordinates are quoted in J2000. 
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the observed colors of background stars to the intrinsic colors (measured in a nearby control field, where the 
extinction is negligible) . Note that this method does not assume a grain-size distribution or dust emissivity 
and therefore provides the most reliable estimate of the total gas column density in a molecular cloud (e.g., 
Goodman et al. 2009; GPS09 hereafter). In this paper, however, we do not directly use the 2MASS Ay 
image to derive the total gas column density because it only covers the central region of Perseus, while we 
are mainly interested in the cloud outskirts. Instead, we use the 2MASS Ay image to calibrate our tiqo 
image derived from the IRIS data (see Section 4.4). 

2.5. Treatment of Angular Resolution 

In this paper, we combine the data sets described above in several ways. In each case, we first bring all 
input data sets to common angular resolution determined by the lowest angular resolution data set. This is 
done by performing convolution with an appropriate Gaussian kernel. For example, to determine the velocity 
range of Hi emission based on the correlation between A^(Hi) and the dust column density, we smooth the 
Hi data to the angular resolution of the 2MASS Ay image, 5' (Section 3.2). To derive 7V(H2), the IRIS and 
Hi data are convolved with a Gaussian kernel to have an angular resolution of 4.3'. Consequently, our final 
Ehi, Sh2, and i?H2 images have an angular resolution of 4.3' (Sections 4.5 and 7). 

3. Methodology: Steps in Deriving TV (Hi) 

3.1. General Properties of Hi Spectra 

The main component of the Hi emission in Perseus is centered at w = +4 ^ +8 km s^^. Compared to 
the molecular gas traced by CO, the peak of the Hi emission is offset by 2-3 km s~^. As an example, we 
compare the Hi and CO spectra at the position of IC348 in Figure [2] There are (at least) two components 
in the Hi spectrum and the stronger one peaks at f +5 km s^^. Interestingly, the location of the dip 
between the two Hi components corresponds to the CO peak at u ^ +8 km s^^. This dip is likely from Hi 
self-absorption (HISA) and indicates the presence of very cold neutral medium (spin temperature Tg < 50 
K; e.g., Gibson 2010). The correction for HISA requires a knowledge of the relative contributions from cold 
and warm gas to the Hi emission, which can be obtained via Hi absorption measurements. However, only 
two Hi absorption measurements exist for Perseus (e.g., Heiles & Troland 2003a). This is clearly too sparse 
to interpolate the relative contributions from cold and warm gas throughout the whole molecular cloud. 
Therefore, we do not apply the correction for HISA to A^(Hi) but we investigate the optical depth effect in 
Section 7.1.1 by applying the first order opacity correction. 

3.2. Determining the Velocity Range 

To determine the velocity range of Hi emission associated with Perseus, we investigate the correlation 
between iV(Hi) and the dust column density. The correlation between Hi emission and FIR emission has 
been well-established for high Galactic latitudes (e.g., Boulanger et al. 1996). We use the 2MASS Ay image 
as a tracer of the dust column density and smooth the Hi data to the angular resolution of the 2MASS Ay 
image. We fix the central velocity of Hi emission at u = -|-5 km s~^ based on Section 3.1 and integrate Hi 
emission using a velocity width Av increasing from 2 km to 80 km s"-*^ with 2 km s~^ intervals. For each 
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derived A^(Hl) image, we calculate the correlation coefficient between A^(Hl) and 2MASS Ay- Only data 
points with signal-to-noise ratio (SNR) > 5 in the 2MASS Ay image are used for this calculation. Figure [s] 
shows the correlation coefficient as a function of Av. Although the correlation coefficient is never very high, 
there is a clear peak at Av ^ 10 km s^^. The lack of strong correlation between Ay and A^(Hi) has been 
noticed before and is mostly ascribed to the presence of molecular gas (e.g., Reach et al. 1994; Dame et al. 
2001; Leroy et al. 2009). We take the correlation coefficient of 0.15 (half of the maximum) as a threshold 
and use the corresponding Av of 20 km s~^ to derive Af(Hi) under the assumption of optically thin Hi gaf|^ 
The derived TV (Hi) ranges from 3.5 x 10^" cm^^ to 1.3 x 10^^ cm^^, with a median of 7.9 x 10^° cm^^. 
We present the A^(Hi) image in Figure |4j 

Sancisi et al. (1974) and Imara & Blitz (2011) estimated 7V(Hl) for Perseus independently and their 
results are essentially consistent with ours. Sancisi et al. (1974) obtained Hi data for Perseus using the 
25-m Dwingeloo telescope and estimated 7V(Hi) by integrating Hi emission from v = —10 km s~^ to -1-20 
km s^^. In addition, they attempted to subtract the foreground and background Af(Hl) by comparing the 
Hi emission in the direction of Perseus to the Hi emission measured in a nearby field spatially offset by a 
few degrees. They found an average A^(Hl) associated with Perseus '-^ 5 x 10^° cm~^. This is close to our 
median A^(Hl). On the other hand, Imara & Blitz (2011) used Hi data from the LAB survey and estimated 
-/V(Hl) by integrating Hi emission from v = —8 km s^^ to +14 km s^^. To determine the velocity range for 
iV(Hl), they assumed that Hi spectra can be approximated with Gaussian functions and set Aw to be equal 
to four times the standard deviation of the Gaussian component associated with Perseus. In summary, both 
Sancisi et al. (1974) and Imara & Blitz (2011) derived iV(Hi) under the assumption of optically thin Hi gas 
and their velocity ranges for A^(Hi) are consistent with ours. 



4. Methodology: Steps in Deriving iV(H2) 

4.1. Removal of Point Sources from the IRIS Data 

Before further processing, we remove point sources from the 60 fim and 100 fj,m images. We identify 
compact point sources from the IRAS Point Source Catalog that have /go > 1 MJy sr~^ and remove them 
by applying a circular mask with a radius equal to twice the angular resolution of the 100 /xm image. These 



masks are shown as blank pixels in Figures ml m ITO) and 14 



4.2. Exclusion of Possible Contaminations 

As Perseus belongs to the Taurus- Auriga-Perseus molecular cloud complex and our data cover a large 
area on the sky (~15° x 9°), there is a concern of including molecular gas not associated with Perseus in 
our analysis. To investigate this, we plot /go and /loo as a function of A^(Hi) in Figure [s] As /6o/^(Hi) 
and /ioo/A^(Hi) are approximations of dust-to-gas ratio (DGR), an overall linear relation, implying a single 
DGR, is a reasonable expectation for a single molecular cloud. Figure [5] shows that the data points are 
segregated into two possible distinct features at 7V(Hi) ^ 6 x 10^° cm~^. The majority of the data points 
shows a global linear relation between A^(Hl) and FIR emission. These data points are shown in gray and 
we assume that they mostly trace Perseus. However, the data points in black have a slightly different slope. 



*For optically thin Hi gas, Af(Hi) is calculated by A''(Hi) = 1.823 X J T^dv (cm where is the brightness temperature. 
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We selected the black data points as having A^(Hi) < 6 x 10^" cm^^. In Figure [6j we show the spatial 
distribution of the black data points by overlaying them on the 100 /ini image. Clearly, they are all found at 
the southeast edge of the image in the direction of Taurus. Taurus is centered at (RA,Dec) = (04^30™, +28°) 
and has vqo = ^ +12 km (e.g., Narayanan et al. 2008). As a distinct molecular cloud, Taurus would 
have a different DGR. To avoid any possible contamination, we exclude (blank) the black data points from 
the IRIS images and further analysis. 

In addition to the above consideration, the warm dust ring that stands out in the dust temperature 
image (see Section 4.3) is an additional source of contamination and we exclude it from further analysis. 



4.3. Tdust and xioo 

For optically thin dust grains at an equilibrium temperature Tdust, the optical depth at 100 /im, rioo, 
is determined by: 

Iioo , -I ^ 

"^i™ ~ ~n(T \ — y ^ ' 

where /loo is the measured intensity at 100 /im and -B(Tdust, Aioo) is the intensity of a blackbody of tem- 
perature Tdust at 100 /im. To estimate tiqo, we first estimate Tdust using the ratio /eoZ-^ioo- Under the 
assumption that the mass absorption coefficient, has a wavelength dependence such as k.\ oc A^^, Tdust 
can be estimated from: 

^60 _ / Ago \~^^+^^ exp(fec/AioofcTdust) - 1 ^2) 
Aoo V^ioo/ exp(/ic/A6ofcTdust) - 1 ' 

where h is the Planck constant and fc is the Boltzmann constant. The emissivity spectral index, can vary 
between 1 (for amorphous carbon) and 2 (for metallic and crystalline dielectric material), depending on the 
grain composition and size. Throughout this paper, we adopt j3 — 2 based on the discussions in Schnee et 
al. (2005; SOS hereafter). To estimate Tdust, we smooth the 60 /im image to the angular resolution of the 
100 /im image. 

/go may include a significant contribution from stochastically heated very small grains (VSGs) that must 
be accounted for. To do this, we adopt the method used by S05. First, we smooth the 60 /im and 100 /im 
images to the angular resolution of the Tdust map derived by Schlegel et al. (1998; SFD98 hereafter). SFD98 
used the low resolution DIRBE 100 /im and 240 /im data to derive Tdust; the contribution from VSGs is 
expected to be negligible at these wavelengths (e.g., Li & Draine 2001). Second, we calculate Tdust based on 
100% of /loo and a lesser percentage of Jgo, assuming (5 = 2. The fraction of /go is chosen to minimize the 
average difference between our Tdust and SFD98's Tdust- We find that the best matching fraction is 22%, 
meaning that 78% of /go is due to VSGs. This result is consistent with SOS, who found 26%, but lower than 
the prediction of interstellar dust model for the solar neighborhood (^40%; Desert et al. 1990). 

We present the Tdust image in Figure [Tj Our Tdust image is generally consistent with the Tdust image 
derived by SOS, but shows systematically slightly lower Tdust- The difference between our Tiust and SOS's 

Tdust , Tdust Tdust 

^go5, has a median of —1 K. This discrepancy likely arises from the fact that SOS adopted 
a Tdust-dependent formula for /3 and used a different fitting algorithm. Their Tdust-dependent formula for /3 
considered the observed inverse relation between Tdust and /3 and had two free parameters (see Equation 2 
of SOS). To simultaneously fit these two free parameters, SOS used the IDL routine AMOEBA, which adopts 
the downhill method of Nelder & Meade (196S), while we estimated the best matching parameter using the 
least-squares method. 
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Tdust across Perseus ranges from ^^16 K to ^22 K. The lowest r,uist regions spatially correlate with the 
highest Ay, in agreement with the expectation for dense and cold molecular gas. Several localized regions 
with high Tdust stand out; two are the star-forming regions, IC348 and NGC1333, while the third is a ring-like 
feature located at (RA,Dec) = (03^39™30%-F32°) with a radius of ~56'. Ridge et al. (2006b) investigated 
this ring in detail using IRAS, MSX, 2MASS Ay, and CO data and concluded that the ring, the product 
of an Hll region driven by the B-type star HD 278942, is likely located behind IC348. However, they also 
argued that the ring is interacting with Perseus, based on the spatial correlation between 8 /zm emission 
and 2MASS Ay- The ring is visible in the 2MASS Ay image but at a much lower contrast, implying that 
the ring is not as significant column density feature as one might infer from the IRIS data. Therefore, we 
exclude it from further analysis by applying a circular mask of radius of 56'. This mask size is consistent 
with what Ridge et al. (2006b) used to separate the ring from the Perseus cloud component, -^50'. 

At this point, we refined the zero point of the Tiqo image by assuming that the dust column density traced 
by Tioo is proportional to A''(Hi) for the regions dominated by diffuse atomic gas. Under this assumption, 
Tioo would approach zero as A'^(Hl) approaches zero and any offset from this relation is likely due to the 
uncertainties in the zero points of the IRIS images. As a result of this exercise, we added 1.8 x 10"'* to the 
Tioo image. The observed offset in noo implies a small additional uncertainty in T^ust, which is of order of 
0.03-0.6 K. 



4.4. Av 

Once we estimate Tioo using the VSG-corrected Tjust) we convert Tioo to Ay by: 

Ay = Xnoo, (3) 

where X is the conversion factor that relates the dust column density to the extinction in the y-band. The 
conversion factor X = 720 is estimated by finding the best agreement between our derived Ay image and 
the 2MASS Ay image for the region where the two images overlap. Our Ay and 2MASS Ay have a 1:1 
relation but with an average scatter of ^0.5 mag. This scatter around the 1:1 relation is mostly attributed 
to the variation in Tjust along a line of sight (e.g., GPS09). 

We note that the bootstrapping of our Ay image derived from the IRIS data with the Ay image 

derived from the 2MASS extinction data is motivated by GPS09. For Perseus, GPS09 inter-compared three 
methods for measuring total gas column density; extinction mapping at near-infrared (NIR) wavelengths, 
thermal emission mapping at FIR wavelengths, and CO emission mapping. They found that the dust-based 
measures (NIR extinction and FIR emission) are superior to the gas-based measure (CO emission) and 
particularly suggested the NIR extinction as the best probe of total gas column density. Any molecular 
transition, such as CO emission, traces a limited range of volume densities due to the critical density for 
excitation, opacity, and depletion. The dust-based measures have a wider dynamic range, but the NIR 
extinction measure saturates at high extinctions where background sources cannot be observed and the FIR 
emission measure is limited at low extinction values. In addition, the FIR emission measure suffers from 
its intrisic uncertainty, the variation of T^ust along a line of sight. Overall, the NIR extinction measure is 
the best probe up to its saturation level. Therefore, our calibration of noo to the 2MASS Ay image would 
provide a better estimate of the total gas column density, compared to the total gas column density derived 
solely from the IRIS data. 
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4.5. Dust-to-Gas Ratio and 7V(H2) 

Finally, we calculate A^(H2) by: 

where DGR is defined by DGR = ^y/7V(H) and iV(H) = iV(Hi) + 2iV(H2). 

To measure DGR in Perseus, we plot Ay as a function of iV(Hi) over the whole cloud in Figure |8] As 
we expect iV(H) ^ Af(Hi) for the regions dominated by diffuse atomic gas (which correspond to the majority 
of data points in Figure [sj, the fitted slope Av/N(Hi) = 1.1 x 10^^^ mag cm^ is a good measure of DGR. 
Our estimate is ~2 times higher than the typical Galactic DGR, 5.3 x 10~^^ mag cm^ (e.g., Bohlin et al. 
1978). This is not concerning as notable variations have been found for the Galactic DGR. For example, 
Kim & Martin (1996) compiled Ay/NCR) values for the Galaxy derived from UV absorption measurements 
from literature and found that Ay/N{}i) can be higher than the typical Galactic value by a factor of three 
(see their Figure 4). 

We also compare our DGR estimate with values obtained from two independent methods: 

(1) Copernicus/FUSE absorption measurements: Toward HD 22951, HD 23180, and BD+31°643, Coperni- 
cus/FUSE measured H2 directly in absorption (Savage et al. 1977; Rachford et al. 2002). HD 22951 and 
HD 23180 are Bl stars and BD+31°643 is a B5 star. Towards BD+31°643, Snow et al. (1994) measured 
Ay = 2.68 mag and combined this with the observed A^(Hl) and N(R2), resulting in Ay/N{E) = 5.5 x 10~^^ 
mag cm^. However, extinction parameters are not known for HD 22951 and HD 23180. Considering that 
HD 22951, HD 23180, and BD+31°643 are close to each other on the sky, if we apply Ry ~ 3.19 measured 
for BD+31°643 to both HD 22951 and HD 23180, we obtain Ay/N{ll) = (4.6-6) x lO'^^ mag cm^. While 
these estimates are close to the typical Galactic DGR, we note that they are likely lower limits on the DGR 
of Perseus, considering that most of the lines of sight surveyed by the Copernicus/FUSE are composed of two 
or more diffuse clouds (e.g., Rachford et al. 2002, 2009). Rachford et al. (2002) compared the observed high 
rotational transition column densities of H2 to the model of H2 formation, destruction, and ro-vibrational 
excitation developed by Browning et al. (2003) and found that it is impossible to reproduce the observed 
column densities under a single cloud assumption. Instead, models with multiple clouds along the line of 
sight, and/or multiple pathways of incident UV radiation, match much better the observed column densities. 
Stars preferentially form in regions with high DGR, where dust grains provide both high shielding against 
photodissociation and sites for H2 formation. Therefore, if several diffuse clouds exist along the line of sight 
toward Perseus, the component associated with Perseus likely has a higher DGR. In this case, the DGR 
measured along the whole line of sight is lower than the actual DGR of Perseus. 

(2) Upper limit on the DGR: Equation (4) itself provides an upper limit on the DGR. If the DGR is too 
high, a large number of data points will have a negative value of A^(H2). For example, if DGR > 1.5 x 10~^^ 
mag cm^, more than 50% of the pixels with Ay > 1 mag will have a negative value of A^(H2). Therefore, it 
is most likely that the DGR in Perseus is less than 1.5 x 10^^^ mag cm^. 

Conclusion: Based on two independent constraints on the DGR, we conclude that our estimate of Ay /A^ (Hi) = 
1.1 X 10~^^ mag cm^ is midway between the lower and upper limits and therefore represents a very reasonable 
estimate. 
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4.6. Uncertainty in iV(H2) 

We estimate the uncertainty in iV(H2) for each pixel by performing a series of Monte Carlo simulations. 
In these simulations, we take into account the uncertainties in A'^(Hi) and Ay and how they propagate into 
iV(H2). 

For the uncertainty in A^(Hl), we generate 1000 A'^(Hl) images by using 1000 velocity widths randomly 
drawn from a Gaussian distribution that peaks at 20 km s^^ with la of 4 km s~^ (based on Section 3.2). 
The median la of A^(Hl) due to the systematic uncertainty in using a particular velocity width is 5.6 x 10^^ 
cm~^. 

For the uncertainty in Ay, we assess the range of Ay by varying the input parameters and deriving 
Ay each time. We add/subtract la noise to/from the 60 /xm and 100 /xm images (0.07 MJy sr^^ at 60 /im 
and 0.48 MJy sr"-*^ at 100 /im) and use /3 = 1 or 2. Therefore, we derive the Ay images with 8 different 
combinations of the input parameters and for each case the contribution from VSGs to Jgo and X for 
Ay — Xtioq are determined by finding the best matching parameters. To include the uncertainties of the 
zero point, we also derive the rioo images with and without the zero point calibration. We find that the 
contribution from VSGs to Jgo varies from 0.12 to 0.24 with a median of 0.17, while X varies from 630 to 
810 with a median of 728. Finally, we compare the 16 Ay images and find the maximum and minimum 
estimates of Ay for each pixel. 

The uncertainty in A^(Hi) and the maximum/minimum estimates of Ay are incorporated in a Monte 
Carlo simulation to produce 1000 7V(H2) images. We use the distribution of the simulated iV(H2) to estimate 
the uncertainty in N{}i2) on a pixel-by-pixel basis. The median la of N{}i2) is 3.6 x 10^^ cm^^. 

In addition to the above considerations, several systematic effects are also likely to affect iV(H2); (1) the 
assumption of a single dust population and temperature along a line of sight, (2) the assumption of a single 
DGR, and (3) the assumption that II2 is the dominant source of excess dust emission. As it is difficult to 
estimate the magnitude of these effects, we do not include them in our Monte Carlo simulations. 



5. General Results 
5.1. Tdust 

Tdust across Perseus ranges from ~16 K to ~22 K with a median of 17 K. This is in agreement with 
SFD98 who found Tdust ^ 17 K for the same region and SOS who found a slightly higher Tdust ^ 18 K for 
the main body of Perseus. Similarly, Stepnik et al. (2003) derived T^ust ^ 17 K for the outskirts of Taurus 
using the DIRBE data at 140 /im and 240 ^m. 

As shown in Figure [7] the lowest Tdust regions spatially correlate with the Ay and CO images, in 
agreement with the expectation for dense and cold clouds. The prominent minima in Tdust correspond to 
B5, BIE, Bl, L1448, L1451, and L1455. These dark regions contain a number of IR sources and Herbig- 
Haro objects that are considered to be associated with pre-main sequence stars (e.g.. Bally et al. 2008), but 
the luminosities of these objects are so low that their influence on the overall energy budget of the clouds 
would be negligible. Therefore, Tdust within these dark regions is determined by the external ISRF and 
dust grain properties. Low Tdust in B5, BIE, Bl, L1448, L1451, and L1455 compared to the surrounding 
regions suggests that the inside of these regions gas is dense enough to have sufficient shielding against the 
external ISRF, leading to efficient cooling. For example, B5 has limb-brightening in CO, indicating a cold 
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cloud interior surrounded by a photo-heated exterior (e.g., Beichman et al. 1988). 

Figure |7] also shows several localized regions with high T^ust • One of the prominent features is the 
warm dust ring at (RA,Dec) = (03h39"^30",-f 32°), already discussed in Section 4.3. The other high Tdust 
features include IC348 and NGC1333. To quantify how much hotter these star-forming regions are, we plot 
the histograms of Tdust in Figure [9j For the histograms, we use the data points in the rectangular boxes 
shown in Figure [lO] Each box has either a dark or a star-forming region and extends up to the edge of the 
image. The width of each box is set based on the COMPLETE ^•^CO(l-O) data to include the whole region 
of interest. While all regions have a similar median Tdust of 17 K, the star-forming regions have data points 
with higher Tdust- For example, for IC348 and NGC1333, 11% and 2% of data points have Tdust > 18 K, 
while for B5, BIE, and Bl, all data points have Tdust < 18 K. The data points with Tdust > 18 K are found 
in the central parts of IC348 and NGC1333, where B-type stars are located (blue crosses and red stars in 
Figure [7| , and this implies that the internal radiation field may play a significant role in heating dust grains 
inside IC348 and NGC1333. This possibility is discussed further in the next section. 



5.2. ISRF: External or Internal? 

We now investigate possible heating sources that can result in the measured Tdust • As we mentioned in 
Section 1, KMT09's model assumes a uniform external ISRF. Understanding the origin of ISRF is therefore 
important for comparison between observations and the model. There are two possible sources of ISRF that 
are responsible for the measured Tdust across Perseus; (1) the UV flux from the B-type stars in Perseus 
(local) and (2) the UV radiation field determined by the overall stellar distribution in the Galaxy (global). 

(1) We consider two B5 V-type stars in Perseus (red stars in Figure [7| as possible sources of the local 
ISRF. All other stars have spectral types later than Bf|^ We assume that the spectral energy distribution 
of a B-type star is approximately a black body curve (e.g., Spaans et al. 1994) and adopt Tcs = 15600 K for 
a B5 V-type star (Morton & Adams 1968). For A = 912-2066 A, two B5 V-type stars emit 2.12 x 10^^ gj.g 
cm~^ s~^. If a distance of 15 pc from each star, which corresponds to ~l/3 of the longitudinal size of our 
sky coverage, is assumed, the UV fiux from the two B5 V-type stars is reduced to 3.04 x 10~^ erg cm~^ s~^ 
(l/r^ geometrical dilution). For dust grains in thermal equilibrium, Tdust is determined by (Lequeux 2005): 

G 4.6 X 10-" (tt^) rlust erg cm'^ s-\ (5) 

where G is the UV fiux and a is the size of dust grains. An absorption efficiency Qa = 1, appropriate for 
large dust grains that dominate the FIR emission and have a size comparable to UV wavelengths, and a dust 
emissivity spectral index /3 = 2 are assumed for Equation (5). Therefore, dust grains with a radius a = 0.1 
/im in the UV radiation field of 3.04 x 10^^ erg cm^^ s^^ would have Tdust ^ 9.3 K. Note that we do not 
consider the decrement of UV fiux by dust attenuation. Therefore, if the B-type stars are the only sources 
of the ISRF, we would expect Tdust ^ 9.3 K in the outskirts of Perseus. 

(2) We consider the global Galactic ISRF of Porter & Strong (2005). Porter & Strong (2005) adopted 
the Galactic stellar and dust distribution models and calculated the radiation field from stars at A = 0.1-1000 
fim. The ISRF was found to be the most intense in the inner Galaxy, to decrease toward the Galactic poles. 



^Two B8 V-type stars in NGC1333 (blue crosses in Figure [Tjl contribute negligibly to the UV flux. We do not consider the 
B4 IV- V-type star (red triangle in Figure [7]l due to its uncertain stellar classification (e.g., Lesh 1969; Klochkova & Kopylov 
1985; Cernis 1990). Nevertheless, inclusion of this star does not change our result. 
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but to remain steady up to ^5 kpc from the Galactic plane. For i?gai (Galactocentric radius) = and z 
(Galactocentric height) = 0, the radiation field at 0.1 /xm is ~0.08 eV cm~^ and for i?ga,i = and 2; = 5 kpc, 
the radiation field at 0.1 [xm is ~0.01 eV cm~^ (see Figure 2 of Porter & Strong 2005). If we assume that 
this ~87.5% decrement of the UV radiation field at iJgai = from 2; = to 5 kpc would be applicable for 
-Rgai ^ 8 kpc, the UV radiation field at z ^ —100 pc (the location of Perseus; i?gai ~ 8 kpc and z ^ —100 
pc) would be between 2.7 x 10""^ (the UV radiation field in the solar neig hborhood at A = 912-2066 A: Gq) 
and 3.4 x 10"" erg cm~^ s~^ (12.5% of Go). Applying Equation (5), this corresponds to 14 K < Tdust < 
20 K. As our median T^ust ~ 17 K is indeed in this range, the UV ISRF determined by the overall stellar 
distribution in the Galaxy could be the dominant heating source for dust grains across Perseus. 

Therefore, we conclude that Perseus is embedded in the uniform Galactic ISRF that heats dust grains 
up to Tdust ~ 17 K, except for the central parts of IC348 and NGC1333 where the radiation from the B-type 
stars is dominant. 



6. The Hi— H2 Transition in Perseus: Theoretical Perspective 

KMT08 modeled II2 formation and photodissociation for a spherical cloud that is bathed in a uniform 
ISRF. They assumed that the H1-H2 transition is infinitely sharp and found that the location of the transition 
inside an atomic-molecular complex is solely determined by two dimensionless numbers: 

^ _ /dissC^dC-Ep 



TR = ncNMCTd-R- (7) 

Here /diss ~ 0.1 is the fraction of absorptions of LW photons that produce H2 dissociation rather than decay 
back to a bound state, Ud is the dust absorption cross section per H nucleus in the LW band, is the 
number density of LW photons, ncNM is the number density of CNM in the Hi shielding layer, Tl is the H2 
formation rate coefficient on dust grains, and R is the cloud radius, x is the ratio of the rate at which LW 
photons are absorbed by dust grains to the rate at which they are absorbed by H2. We can consider x as a 
dimensionless measure of the strength of the ISRF. For example, in strong radiation fields, H2 molecules are 
easily photodissociated and therefore the absorption of LW photons is dominated by dust grains (large x)- 
On the contrary, in weak radiation fields, H2 molecules survive and due to their large resonant cross section 
the absorption of LW photons is dominated by H2 molecules (small x)- tr is simply the dust optical depth 
that the cloud would have if its density is equal to ticnm- 

Using X and tr, KMT08 derived xh2 , the fraction of the cloud radius where the H1-H2 transition occurs: 

-,1/3 



4(tr + 0.2^-) 



(8) 



4rR - 0.7V' 

where 

, 2.5 + x 
2.5 + xe 
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Then, the dust optical depth through the Hi shielding layer, thi, can be determined by: 

THi = nCNMCd-RCl - XU2) 

_ trV^ (9) 
~ 4tr-0.7^' 

The Hi shielding surface density for H2 formation, Shi,s) is simply: 

Sh,,s = TH, , (10) 

where /in is the mass per H nucleus. 

To investigate what fraction of the gas in an atoniic-niolccular complex is molecular, KMT09 simplified 
the above equations as follows. First, as both ad and TZ are the measures of dust surface area, their ratio 
is nearly unity and drops out of Equation (6). Second, ncNM is determined from the condition of pressure 
balance between the cold neutral medium (CNM) and warm neutral medium (WNM) phases: 

ncNM = ?!'CNM»^min 

31G"o (11) 

- 0CNM ^ ^ 31^/0.365 • 

Here G'q is the incident UV radiation field and Z' is the metallicity. The primes indicate that quantities are 
normalized to the values in the solar neighborhood. There is a maximum temperature at which CNM can be 
in pressure balance with WNM and the corresponding density is the minimum CNM density, riynin (Wolfire 
et al. 2003). Typically, (pCNM ~ 1-10. As ticnm Gq oc -Eq, Gg and Eq drop out of Equation (6) and x 
becomes purely a function of Z' and ij^cnm: 

. = 2.3i±§l^. ,12) 

•PCNM 

To estimate tr, KMT09 started from the mean column density, Scompj of a spherical cloud that consists 
of a molecular core of number density rimoi and a Hi shielding layer of number density ticnm: 

4 

5^comp = ^MH«CNM-R [1 + {(t^mol " '^)Xu2] ■ (13) 

Here 0nioi is the ratio of nmoi to ncNM- Once Tc, the dust optical depth that the cloud would have if its Hi 
and H2 are uniformly mixed, is defined by: 

- = i(^)' 

tr can be expressed as a function of Tc, (/imoi) and X}i2'- 

3 / ^compf^d 



TR 



4 V m J [1 + ((/>moi - l)a:^ 

;:3 



1 (15) 



■ [1 + ((/"mol - 1)2;; 



H2J 
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As a consequence of these simplified functions, the H2 mass fraction, /h2, can be expressed with the 
following analytic function: 



/H2 



M 



4re 



4rc + 3(0,noi - 



where AfH2 is the H2 mass and M is the total cloud mass. Finally, the H2-to-Hi ratio, i?H2, can be expressed 
as: 

Rh2 = fm/ /hi 

= /h2/(1 - /h2) 



(17) 



4rc + 3(0,„oi - 1)V 



With ip being a function of Z' and 0cnm, and Tc being a function of the cloud gas column density, i?H2 is 
determined by the total gas column density, Z', (ficNM, and (jy^noi, and is independent of the strength of the 
ISRF. While the total gas column density and Z' are direct observables, 0cnm and 0nioi are not. KMT09 
adopted 0cnm = 3 and (f>moi = 10 as fiducial values. Note that the H2-to-Hi ratio can observationally be 
expressed as i?H2 = 5]h2/5]hi- 

Another important model prediction is that Ehi,s has a dependence on Z' and is almost independent 
of the strength of the ISRF. Based on Equation (9) and (10), thi slightly increases with Z' via tr and 
and (Td increases with Z' via tid = cr'^Z' . Therefore, Shi,s decreases with Z' to a power less than unity as 
Shi.s fx THi/cd. For solar metallicity, KMT09 predict Ehi.s ^ 10 Mq pc^'^. 



7. The Hi H2 Transition in Perseus: Observational Perspective 

From our derived A^(Hi) and 7V(H2) images, we calculate Shi and Eh2 by: 

iV(Hi) 



Ehi (M© pc-") 



1.25 X 1020 (cm- 



Sh2 (Mo pc-2) = ... ^!.fc,') _2r (18) 
6.25 X lO^'^ (cm ^) 

We find that Shi is relatively constant across Perseus with a range of 5-11 Mq pc"^^ while Sh2 has a wider 
range of 0-73 pc"^. This results in a small dynamic range of Ehi + Sh2, from 8 pc^^ to 30 Mq 
pc~2 across most of the Perseus cloud. 

We then derive i?H2 = Sh2/Shi, which allows us to directly test KMT09's predictions. The i?H2 image 
is presented in Figure [T0| We find that the derived i?H2 ranges from to 10, with a median of 0.1. This 
suggests that diffuse molecular gas is pervasive in Perseus. For example, the data points with i?H2 > Ij 
which can be considered to be molecular-dominated, comprise only 9% of the total data points. We note 
that spatial variations in the J?h2 distribution are mainly determined by the Sh2 distribution, due to the 
small dynamic range of Ehi. The high i?H2 features spatially correlate with the Ay and CO images and 
correspond to the dark and star-forming regions in Perseus. 
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7.1. Shi vs Shi + 5]h2 for the Dark and Star-forming Regions 

To investigate KMT09's predictions, we focus on several dark and star-forming regions in Perseus and 
measure _Rh2 radially to probe the cloud envelopes with the lowest i?H2- In Figure |11| we plot Ehi as 
a function of Shi + Sh2 for B5, IC348, BIE, Bl, and NGC1333. For each plot we use the data points 
in the rectangular boxes shown in Figure [TO] The overlaid curves are KMT09's predictions for particular 
combinations of input parameters, Z', 0cnm, and i^moi- These parameters will be determined and discussed 
in Section 7.2.1. 

We find an almost constant Ehi ^ 6-8 M0 pc~^ for each region. This is consistent with the small range 
of Shi found across the whole Perseus cloud. However, the median value of Shi slightly changes between 
the regions. For example, B5 has the lowest median Shi ^ 5.8 Mq pc~^ and NGC1333 has the highest 
median Ehi ^ 8.2 M0 pc^^. A similar result for Perseus was pointed out by Sancisi et al. (1974) who 
found that Sfji remains approximately constant at ^8 pc~^ for a wide range of interstellar extinction. 
Several extragalactic studies have also noticed that Shi does not exceed ^10 Mq pc^^ for nearby galaxies 
(e.g., Wong & Blitz 2002; Bhtz & Rosolowsky 2004, 2006; Bigiel et al. 2008; Wong et al. 2009). 

In KMT09's model, H2 formation requires a certain amount of Ehi.s to shield H2 against photodis- 
sociation. Once this minimum Ehi is achieved, H2 forms out of Hi and Shi remains constant. KMT09's 
model predicts Shi,s ^ 10 Mq pc^^ for solar metallicity and this is indeed consistent with what we find in 



Figure 11 For all five regions, we find saturation of Shi ~ 6-8 M0 pc~^. However, in the case of B5 and 
NGC1333, we do not probe the regions of purely atomic gas, which correspond to the linear portion of the 
model curves in Figure [TT] For IC348, BIE, and Bl, we see hints of a possible turnover between atomic- 
and molecular-dominated zones at Eri + Sh2 = 5-8 Mq pc~^, yet our observations do not sample well the 
purely atomic zones. This inability to probe purely atomic regions is not due to the sensitivity of our Shi 
and Eh2 images. In Figure |11| we show the median 3a values of Ehi and Ehi + Eh2 for the whole Perseus 
cloud as the black dashed lines. All data points are clearly above the 3a values and this suggests that the 
lack of turnover detection is due to the highly extended atomic envelopes. For example, in the case of B5 
and NGC1333, we are probing radial profiles up to ~15-20 pc from the centers of the regions. Therefore, 
the purely atomic envelopes are likely located more than ~20 pc from the molecular peaks. 



7.1.1. Effect of High Optical Depth 

An alternative or additional explanation for the relatively constant Ehi we measured could be a high 
optical depth of Hi in Perseus. If Hi gas is optically thick, the measured brightness temperature Tb levels 
off at a temperature close to the kinetic temperature Tk. Therefore, under the assumption of optically 
thin Hi gas (which we used to calculate our Ehi), Ehi would saturate as Ehi oc Tb ^ Tk and therefore be 
underestimated. There are a few lines of evidence that indicate the presence of cold, optically thick CNM in 
Perseus. For example, HISA features were found toward the centers of several dark and star-forming regions 
(e.g.. Ridge et al. 2006a). In addition, Heiles & Troland (2003b) analyzed the Hi emission/absorption spectra 
obtained toward two radio sources behind Perseus, 3C93.1 and NRAO140, and found CNM components in 
Perseus (based on radial velocity information) that have optical depths r ~ 1 and ^7. 

The best way to estimate the effect of optical depth is to measure CNM in absorption in the direction of 
many background radio sources behind Perseus. This provides the total column density of atomic hydrogen 
gas (CNM -I- WNM) along each line of sight, which can be compared with the column density we measured 
in Section 3.2 to derive the optical depth corrections for the whole cloud. As only two such measurements 
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exist so far, we are in the process of obtaining additional Hi absorption measurements. Here we investigate 
only the magnitude of the optical depth effect. 

If we assume Tg = 70 K as measured by Heiles & Troland (2003b), the corrected A^(Hi) will need to be 
calculated using the following equation: 

Af(Hl) (cm-2) = 1.823 x 10^^ J In (jT^-^^ dv. (19) 

If we apply this equation to our iV(Hl) image, we find that Ehi changes by a factor of 1.2-2. Another 
approach involves using the statistics of CNM and WNM from Heiles & Troland (2003b). Their Table 1 
summarizes the CNM and WNM column densities for all sources in their survey. We can then estimate 
the fraction of CNM column density (relative to the total column density) as a function of WNM column 
density. Using only the table entries for a region centered on Perseus and ^1200 deg^ in size, we find that 
the CNM fraction is close to 50% for WNM column density less than 1.2 x 10^^ cm~^. This again suggests 
that our A^(Hi) or Ehi could be underestimated by a factor of two. 

Our excercise shows that a high optical depth along each line of sight could result in a factor of two 
spread in Ehi for Perseus. This is not substantial and therefore the almost constant Ehi we measured is 
likely a real physical feature, possibly driven by the conversion of Hi into H2. However, the magnitude of 
the optical depth effect we estimated is approximate and measuring the true Ehi distribution across Perseus 
needs a more sophisticated investigation. 



7.2. -Rh2 vs Shi + Sh2 for the Dark and Star-forming Regions 

We now investigate KMT09's prediction of i?H2 vs Shi + Sh2 for the dark and star-forming regions in 



Perseus. In Figure 12 we plot i?H2 as a function of Shi + Sh2 for B5, IC348, BIE, Bl, and NGC1333. For 



each plot we use the data points in the rectangular boxes shown in Figure |10[ 

We find that the relation between i?H2 and Shi + Sh2 is remarkably consistent for all dark and star- 
forming regions. Each plot in Figure [12] shows a very sharp rise of i?H2 at low Shi + Sh2! a turnover around 
i?H2 ^ 1, and a slow increase toward higher i?H2- For the two star-forming regions, i?H2 reaches a value of 
^10, while for the three dark regions, i?H2 is less than 5. 



7.2.1. Best-fit Curves 

In Figure |12[ we overlay KMT09's predictions as red solid curves and find excellent agreement. The 
overlaid curves were derived using Equation (17) with the assumption of (pnmX = 10 and Z' = 1 for all dark 
and star-forming regions. The fiducial value (p^oi = 10 was adopted following the discussion in KMT09. We 
note that ^moi does not make a significant change in -Rh2- For example, between (^moi = 10 and 50, i?H2 at 
Shi + 5]h2 = 100 M0 pc~^ only varies by a factor of 1.1. Our assumption of a single Z' — 1 is based on the 
result of Gonzalez Hernandez et al. (2009) and our DGR investigation in Section 4.5. Gonzalez Hernandez et 
al. (2009) studied the chemical composition of Cernis 52, an A3 V-type star in the direction of Perseus and 
derived a metallicity of [Fe/H] = —0.01 ± 0.15 (corresponding to 0.7-1.4 Zq). They determined a distance 
to Cernis 52 of 23llg5^ pc and in conjuction with the radial velocity information concluded that Cernis 52 is 
likely a member of IC348. As a single DGR fits most of the diffuse regions in Perseus, there is probably no 
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significant variation of Z' across the cloud. Therefore, we adopt Z' = \ for all dark and star-forming regions 
in Perseus throughout this paper and constrain only (/)cnm from our fitting 

We performed Monte Carlo simulations to determine the best-fit curves with consideration of uncertain- 
ties in both i?H2 and Shi + Sh2- We added random offsets to i?H2 and Shi + Sh2 and repeated this process 
1000 times. For each variable, offsets were drawn from a Gaussian distribution with the standard deviation 
equal to the measured noise. For each realization, we determined the best-fit curve by assuming (/)moi = 10 
and Z' — \ and finding (/)cnm that minimizes the sum of the squares of the residuals. IDL routine MPFITFUN 
(Markwardt 2009) was used for the fitting. Finally, we estimate the median ^cnm among the simulated 1000 
^CNM and use it as the best-fit parameter. We summarize this best-fit (/)cnm for each region in Table 1. 



1.2.2. Fitting Results 

We find that (f>cNM ranges from ~6 to '^lO. As (pcNM determines the typical CNM number density via 
'^CNM = 0CNM'T-min, ^CNM = 6-10 translates into Tcnm = 60-75 K under the assumption of thermal pressure 
allowing the coexistence of CNM and WNM (see Equation 19 of KMT09). This temperature range is indeed 
consistent with typical CNM properties in the solar neighborhood. Heiles & Troland (2003b) examined Hi 
emission/absorption spectra obtained toward 79 radio sources and found a median Tg of ^70 K for CNM. 

While 0CNM appears to systematically decrease toward the west of Perseus, we find no significant 
difference in 0cnm between the dark and star- forming regions. This implies that Hi envelopes surrounding 
these regions have similar ncNM- We estimate ncNM = 20-30 cm~^ using G'q = G/Gq = 0.4 derived from 
Equation (5) with Tdust = 17 K. The similar ncNM further suggests a similar normalized radiation field x 
of 1-1.5. As X measures the relative importance of dust shielding and H2 self-shielding, x ^ 1 implies that 
dust shielding and H2 self-shielding are equally important for H2 formation in Perseus. 



7.2.3. Column Density at The H1-H2 Transition 

KMT09 assume an infinitely sharp H1-H2 transition and in this case i?H2 would be zero at the Hi 
shielding column density. However, in reality, the transition cannot be infinitely sharp and KMT09 estimate 
i?H2 ~ 0.25 at the Hi shielding column density by comparing the theoretical Hi shielding column densities 
with the detailed numerical radiative transfer calculations. For solar metallicity, KMT09 predict (as shown 
with the red solid curves) that i?H2 ^ 0.25 occurs at Ehi + Sh2 = 7-10 M© pc~^. This is indeed consistent 



with what we see in Figure [T2| i?H2 ^ 0.25 at Sri + Sh2 = 6-12 Mg pc"^ or 7V(Hi) + 27V(H2) = (8-14) 
X 10^° cm^^. This transition column density is a factor of 2-3 larger than what previous studies found 
in the Galaxy via UV absorption measurements (e.g.. Savage et al. 1977; Gillmon et al. 2006). However, 
considering that the previous studies defined the transition column density at i?H2 ^0.1 and Sh2 linearly 
increases with Shi + Sh2, our result is consistent with the previous studies. 



We have attempted to determine the best-fit curve for -Rh2 vs Ejji -I- Sh2 by using both Z and </<cnm as free parameters 
but have found that the results are consistent with the case of Z' = 1 and varying ^CNM within Icr. 
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7.3. Ru2 Radial Profiles 

To investigate how i?H2 radially changes from a molecular peak, we plot i?H2 as a function of distance 
from the center of each dark and star-forming region in Figure |13[ We use the data points in the rectangular 



boxes (see Figure 10 1 and calculate the mean i?H2 in three-pixel size bins (bin size = 12.9', corresponding 



to 1.1 pc at the distance of 300 pc) to remove small-scale fluctuations. 

For all five regions, we find two power-law functions (i?H2 oc r^") in their radial profiles up to r ~ 150'. 
Beyond r ^ 150', the radial profiles show slight fluctuations due to the presence of small-scale structure 
in the i?H2 image and/or become flat. We perform a linear least-squares flt to each radial proflle up to 
r ^ 150' and summarize the results in Table 2 with other properties of the radial proflles. We list the slopes 
of power-law functions (ai and a2), the radius where the transition from the shallow to steep power-law 
functions occurs (rb), i?H2 at (i?b,H2), the radius where the H1-H2 transition occurs (rt), the radius where 
hydrogen is predominantly in the form of Hi (rni), and the thickness of the H1-H2 transition region (c?t). 
We define rt as a radius at i?H2 ^ 0.25, rpji as a radius at i?H2 ~ 0.1, and dt as rt ~ r^. 

We find no significant difference between the dark and star-forming regions in terms of their i?H2 radial 
distribution. The shallow portion of the radial profiles has a slope of a2 = 0.4-0.7, while the steep portion 
has a slope of ai = 1.4-3.7. The transition from the shallow to the steep power-law function occurs at rb ~ 
30-90' or ^3-8 pc. At this rb, i?H2 ranges from 0.5 to 1.2. In addition, we find that H2 extends up to r ^ 
200' or ~17 pc from the centers of the dark and star-forming regions. This suggests that molecular clouds 
are much more extended than what is suggested by molecular tracers such as CO. The H1-H2 transition, 
however, seems to be relatively sharp. We estimate dt = rt — rb, which measures the size of diffuse H2 
layer and therefore could be considered as the thickness of the transition region, and find dt ~ 30-60', 
corresponding to ^3-5 pc. We then estimate the ratio of the thickness of the H1-H2 transition region to the 
region size, dt/R < dt/»'Hi = 0.2-0.4. As we do not know the exact size of the dark and star-forming regions, 
dt/i'm provides only an upper limit on dt/R. 

The other interesting thing to note is that the i?H2 radial distribution becomes smoother toward the 
west of Perseus. Namely, Aa — \ai — a2\ becomes smaller from B5 to NGC1333. This may be related 
to ncNM decreasing from B5 to NGC1333 (inferred from 0cnm decreasing from B5 to NGC1333 with the 
same nmin! see Section 7.2.1), even though it is not clear how tt-cnm affects the smoothness of i?H2 radial 
distribution. In KMT09's model, ncNM determines the location of the II1-II2 transition and due to the 
assumption of the infinitely sharp transition, there is no such relation between ticnm and the smoothness 
of the i?H2 radial distribution. This systematic variation of Aa across Perseus could be resulted from the 
change of internal density structure and/or the complicated viewing geometry. 



8. Comparison of H2 and CO Distributions 

In Figure [14] we compare the H2 and CO distributions. The black and gray contours show the CfA 
Ico data and our I]h2 data, respectively. To ease the comparison, we plot only 3(T contours for both data 
sets. Figure [14] shows that H2 is found wherever significant CO exists. However, the reverse is not true; a 
considerable amount of H2 exists beyond the 3(j contour of Icq . While H2 is generally more extended than 
CO, there are significant spatial variations. For example, CO and H2 trace each other remarkably well in 
the southwest part of Perseus, while H2 is significantly more extended elsewhere (particularly around B5, 
BIE, Bl, IC348, and NGC1333). 
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8.1. Relative Distribution of H2 and CO 

To quantify the relative distribution of H2 and CO, we plot I]h2 and Iqo SiS a function of distance from 
the center of each dark and star-forming region in Figure |15| We first smooth the I]h2 image to the angular 
resolution of the CfA Ico image and use the data points in the rectangular boxes shown in Figure [TO] We 
then calculate the mean Eh2 and Ico in two-pixel size bins (bin size = 16.8', corresponding to 1.5 pc at the 



distance of 300 pc). Finally, each radial profile is normalized by its peak value and therefore Figure 15 shows 
the relative distribution of H2 and CO. As spatial averaging improves SNR, the radial profiles of I]h2 and 
Ico allow us to investigate the relative distribution of H2 and CO more reliably than the visual comparison 
of 3(T contours. 

We find that all five regions have extended H2 compared to CO at all radii. For IC348, BIE, and 
NGC1333, H2 radial profiles show slight fluctuations and/or become flat at r > 150', which has been seen 



in Figure 13 We measure the radius where each H2 and CO radial profile drops off to 1/4 of its peak value 
(rH2 and rco) and calculate the ratio of rH2 to rco- We find that the ratio, 7'h2/''cOi ranges from 1.2 to 2, 
with a median of 1.4. This suggests that on average H2 is ^-^1.4 times more extended than CO in Perseus, 
implying the existence of a substantial amount of "CO-dark" gas. 



8.2. Comparison to Wolfire et al. (2010): "CO-dark" Gas 

The "CO-dark" gas refers to interstellar gas that exists in the form of H2 along with Ci and Cli, but 
little or no CO. Such gas has been predicted from theoretical models of PDRs, which showed that the Cil- 
CO transition occurs deeper into an interstellar cloud than the H1-H2 transition due to the more effective 
self-shielding for H2 than CO (e.g., van Dishoeck & Black 1988; Tielens & HoUenbach 1985). 

Recently, Wolfire et al. (2010; WHMIO hereafter) modeled the "CO-dark" gas and explored how the 
fraction of H2 gas mass in this "CO-dark" component depends on the mass of a cloud (Af ), the UV radiation 
field (Gq) , the metallicity {Z') , and the mean l/-band extinction (Ay). They used a PDR code to calculate the 
H2, Ci, and Cii abundances in a spherical cloud and found that the fraction of "CO-dark" gas, /dg, is ~^0.3 for 
solar metallicity over a wide range of M and Gq. Here /dg is defined as /dg = (-^(^h2)~-^^(^co))/-^-^(''h2)i 
where and r^Q is the radius of the H2 and CO part of the cloud, respectively. Formally, is defined 
as the radius where 2nH2 = nn, and r'^Q is defined as the radius where the optical depth from the cloud 
surface to r^Q is 1. More simply, when expressed in terms of AAyjjQ = Av'(rQQ) — Ay(r^2), /dg depends 
only on the ratio AAv^Bc/Ay. This is because AAy.uc is a measure of the "CO-dark" gas mass and Ay 
is a measure of the H2 gas mass. Interestingly, WHMIO found that AAy^uG is a weak function of Gg and 
is almost constant. Therefore, /dg is strongly dependent only on Ay. For example, for a cloud with CO 
mass of 10^ M0 and solar metallicity, /dg ~ 0-8 is expected at Ay — 2 mag, while /dg ^ 0.2 is expected at 
Ay ~ 20 mag. 

To compare our observations to WHMlO's prediction, we calculate the H2 mass within the 3(T boundary 
of Icq (-M(r3cr,co)) and compare it with the H2 mass within the 3a boundary of Eh2 (-M(r3cr,H2))- We find 
/dg = {M{r3cr,H2) — M{r3a,co))/M{r3a,H2) ~ 0.3. Note that our definition of /dg is different from that of 
WHMIO. This is because r^Q cannot be directly measured from observations. Nevertheless, M(r^2) and 
M(r^Q) of WHMIO represent the total H2 and CO masses and therefore our /dg is conceptually similar 
with what WHMIO used. We estimate Ay ^ 1.5 mag for the data points within r3o-,H2- As WHMIO do not 
provide a prediction of /dg at Ay < 2 mag, we extrapolate it from their Figure 11; /dg ^ 0.8 is expected 
at Ay ^ 1.5 mag. Note that the expected /dg ^ 0.8 at Ay ^ 1.5 mag is for a cloud that has a much higher 
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mass (M ^ 10^ M©) and a stronger radiation field {G'q ^ 10) than Perseus. Considering the predicted 
insensitivity of /dg to the cloud mass and the strength of radiation field, however, we measure a factor of 
three lower value, /dg ~ 0.3. 

To investigate this discrepancy between our /dg and WHMlO's prediction, we first examine the depen- 
dence of /dg on the DGR. As shown in Section 4.5, our estimate of I]h2 and consequently /dg depends on 
the DGR via Equation (4). In the same section, we discussed the lower and upper limits on the DGR. If we 
use the lower limit on the DGR instead of our adopted DGR, Ay decreases to 1.1 mag and /dg increases to 
0.6. Our estimate is a factor of two lower than WHMlO's prediction at Ay ^ 1.1 mag, /dg ^ 0.9. Similarly, 
if we use the upper limit on the DGR, Ay increases to 1.9 mag and /dg decreases to 0.2. Our estimate is 
now a factor of four lower than WHMlO's prediction at Ay ^ 1.9 mag, /dg ~ 0.8. Note that "CO-dark" gas 
exists in Perseus even for these extreme cases and the discrepancy with the model persists. 

Second, we test if /dg is a strong function of Ay as WHMIO predicted. For this purpose, we divide 



the S]h2 image into four regions (see Figure 14 1 and estimate /dg and Ay for each region. We use the I]h2 
image derived from our adopted DGR, 1.1 x 10^^^ mag cm^. In Table 3, we list the estimated /dg and 
Ay for the four regions. We find /dg ^ 0.3 for regions 1 and 2 and /dg = 0.1-0.2 for regions 3 and 4. All 
regions have similar mean extinction. Ay = 1.4-1.7 mag. While we have only four data points covering a 
small range of Ay, we measure a relatively large range of /dGj from '^O.l to ^^0.3. The large variation we 
find in /dg is not consistent with WHMIO, but echos the observations of other diffuse clouds in the Galaxy. 
For example, the Polaris cloud has Ay ~ 0.3 mag (Heithausen & Thaddeus 1990) and /dg ^ 0.36 (Abdo et 
al. 2010). On the other hand, the Pegasus cloud has Ay ^ 0.6 mag (Yamamoto et al. 2006) and /dg ^ 0.6 
(Grenier et al. 2005). 

This region-to-region variation we find in /dg could be due to the DGR systematically changing across 
Perseus. For example, a factor of two lower DGR would allow regions 3 and 4 to have similar /dg as regions 
1 and 2. However, we do not find any evidence for such systematic change of the DGR when examining the 
correlation between 7V(Hl) and FIR emission (see Section 4.2). As an alternative, the complicated viewing 
geometry of the H2 and CO distributions could result in the regional variation in /dg- 



9. Discussion 

The excellent agreement between our data and KMT09's predictions (Sections 7.1 and 7.2) suggests that 
KMT09's model captures the fundamental physics of H2 formation reasonably well even on sub-pc scales. 
This is certainly encouraging, but slightly surprising considering how simple KMT09's model is. Essentially, 
for a spherical cloud bathed in a uniform and isotropic ISRF, the most important factor for H2 formation is 
the Hi shielding surface density Shi.s- Once enough shielding against photodissociation is achieved by Shi,s, 
all additional Ehi is fully converted into Eh2, resulting in a uniform Ehi distribution and therefore a linear 
increase of Sh2 with Shi + 5]h2 • 

Before going into a more detailed discussion of several important aspects of KMT09's model, we would 
like to remind the reader that two main predictions summarized in Section 6 are not independent. The 
predicted behavior of i?H2 as a function of Ehi + Sh2 is a consequence of saturation of Ejji. Therefore, 
the excellent agreement between our data and KMT09's predictions in Figure 12 is entirely driven by the 
constant Eri • While we are not able to probe the purely atomic envelope of Perseus, the constant Ehi of 
6-8 M0 pc~^ we find is in agreement with KMT09's prediction of Shi.s for solar metallicity. 
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In addition, we caution that our data do not allow us to fully test the dependence of i?H2 on nietallicity 
and ISRF. KMT09's model predicts that Rh2 depends on nietallicity, while being almost insensitive to the 
strength of ISRF. While our findings are in agreement with KMTOQ's predictions for solar metallicity and 
~2/5 of the Galactic ISRF, the lack of metallicity and ISRF variations across Perseus does not permit more 
detailed model testing. Future studies of high- latitude molecular clouds and GMCs in external galaxies with 
sub-solar metallicity will provide crucial tests of KMTOQ's model. We now turn to several crucial model 
assumptions and discuss their possible limitations. 

9.1. Equilibrium vs Non-equilibrium H2 Formation 

The most fundamental principle underlying KMT09's model is H2 formation in equilibrium. In equilib- 
rium, H2 formation and photodissociation are balanced and H2 abundance is locally constant. Several authors 
have investigated the time evolution of II2 and found a typical timescale of H2 formation in equilibrium, th2 
= 10-30 Myr (e.g.. Goldsmith et al. 2007; Liszt 2007) 

As Tu2 is comparable to or longer than the estimated lifetime of GMCs, rufe ~ 10 Myr (McKee & 
Ostriker 2007 and references therein), the role of interstellar turbulence in accelerating H2 formation has 
received significant attention in recent years (e.g., Hartmann et al. 2001; Bergin et al. 2004; Glover & Mac 
Low 2007; Glover et al. 2010). The main idea behind the models of H2 formation in turbulent media is 
that shocks induced by cloud-cloud collisions, spiral density waves, supernovae explosions, etc., compress 
diffuse atomic gas, allowing H2 to form in the postshock gas where high enough shielding is achieved against 
photodissociation. While earlier studies have been limited to ID, Glover & Mac Low (2007) and Glover et 
al. (2010) performed 3D magnetohydrodynamical (MHD) simulations of turbulence with detailed chemical 
modeling. They showed that a significant amount of II2 could form within 1-2 Myr, due to the turbulence- 
driven clumping that accelerates H2 formation. However, the H2 abundance keeps increasing over time and 
never reaches equilibrium by 20-30 Myr. Eventually, the gas will become fully molecular unless the H2 
formation is terminated by destruction of GMCs via stellar feedback, e.g., photoevaporation by Hii regions 
and protostellar outflows. This sharply contrasts with KMTOQ's model. 

Our results are consistent with the model of H2 formation in equilibrium and suggest that turbulence 

may play a secondary role in the H2 formation. Clearly, further studies are needed to compare abundances 
of H2 and other molecular species with the predictions from both stationary and turbulent models. 

9.2. The Importance of WNM in the Envelopes of GMCs 

We flnd that Shi is uniform across Perseus with a range of 6-8 Mq pc~^, consistent with KMTOQ's 
prediction of Shi.s for solar metallicity. However, as CNM is far more effective at shielding against pho- 
todissociation than WNM, the predicted Shi.s comes purely from CNM, while our measured Shi includes 
the contributions from both CNM and WNM. As we discussed in Section 7.1.1, our measured Ehi could be 
underestimated by a factor of two for high optical depth gas (CNM). While Shi corrected for high optical 
depth would likely still be comparable to KMTOQ's prediction, the importance of WNM in the envelopes of 
atomic-molecular complexes clearly needs to be taken into account in the models of H2 formation. This is 
also supported by Bolatto et al. (2011) who found a relatively good agreement between KMTOQ's prediction 
and the observed i?H2 vs Shi + Sh2 in the Small Magellanic Cloud, where WNM dominates the total Hi 
(Dickey et al. 2000). At the same time, it is essential to observationally constrain the relative contribution 
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from CNM and WNM to Ehi- We are in the process of obtaining Hi absorption measurements for Perseus, 
which will allow us to separate the contributions from cold and warm gas. 

9.3. The Importance of Internal Radiation 

As we showed in Section 5.2, in the case of IC348 and NGC1333, the embedded B-type stars are the 
dominant sources of the radiation field. KMT09's model does not take into account the role of internal 
radiation in determining i?H2, with the assumption that massive stars producing significant amounts of 
dissociating photons do not stay inside their parent molecular clouds for a long time. While we clearly see 
the importance of the B-type stars for dust heating, we find no significant difference in our analysis of i?H2 
for the dark and star-forming regions. This supports the idea that the internal radiation is not important 
for H2 formation. However, as we show below, this is valid for Perseus because it does not contain any stars 
earlier than the B5 V-type. 

When O- or B-type stars suddenly turn on in a molecular cloud, UV photons from the stars photodisso- 
ciate H2 and create Hi dissociation regions around Hll regions. This variation of the UV radiation field has 
a much shorter timescale (^10'^ yr) than H2 formation (^10^ yr) and therefore totally changes the thermal 
and chemical structures of the molecular cloud (e.g., HoUenbach & Natta 1995; Bertoldi & Draine 1996). 
However, the sphere of photodissociation influence depends greatly on the stellar type. 

Roger & Dewdney (1992) studied the properties of Hi dissociation regions around O- and B-type main 
sequence stars. Their source list includes SVS3, the B5 V-type star in NGC1333 (red star in Figure |7|. An 
Hi region associated with SVS3 was measured to have A/hi = 0.017 M0 and i?Hi = 0.03 pc (Rodriguez et 
al. 1990). This is consistent with an Hi dissociation region created by a star of T^tf ^ 16600 K in a dense 
medium of riui ~ 6000 cm^'^. The size of the Hi dissociation region around this particular B5 star is only 
^1/10 of our spatial resolution. Therefore, the Hi dissociation regions around the individual B-type stars in 
Perseus are below our spatial resolution and therefore are negligible for our consideration of i?H2- 

However, the internal radiation is still important for dust heating inside IC348 and NGC1333. This can 
be understood by the fact that not every UV photon produces photodissociation of H2. While UV and visible 
photons heat dust grains, only the UV photons in the narrower LW band (912-1120 A) photodissociate H2. 
In addition, only ^10% of the LW photons actually photodissociate H2 (/diss ^ 0.1). Therefore, the UV 
photons from the B-type stars in IC348 and NGC1333 could be dominant for dust heating, while relatively 
ineffective for photodissociation of H2 . 

If any stars earlier than B3 V resided in Perseus, on the other hand, the role of the internal radiation 
in determining i?H2 would be no longer negligible. For example, Roger & Dewdney (1992) found that an 
Hi region associated with IRAS 23545-1-6508 has Mhi = 1.4 Mq and i?Hi = 0.4 pc and these properties are 
consistent with an Hi dissociation region created by a B3 V type star in a medium of tt-hi ^ 100 cm^'^. The 
size of the Hi dissociation region is now comparable to our spatial resolution and therefore i?H2 around the 
stars earlier than B3 V would be significantly modified. Therefore, depending on the stellar type of stars 
residing inside GMCs, the internal radiation should not be ignored. 
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10. Summary 

In this paper, we study the H1-H2 transition across the Perseus molecular cloud on sub-pc scales (~0.4 

pc). Wc use the highest resohition Hi data for Perseus from the GALFA-Hl survey, which allow us to probe 
the outskirts as well as the main body of the cloTid. The velocity range of the Hi emission associated with 
Perseus is determined based on the correlation between Af(Hi) and the dust column density traced by the 
2MASS Ay image. We then derive the A''(Hi) image under the assumption of optically thin Hi gas. The dust 
column density is estimated using the IRIS 60 /um and 100 yum images and is calibrated with the 2MASS 
Ay image. In combination with the A^(Hl) image and the derived DGR, we use the dust column density to 
derive the iV(H2) image. 

The iV(Hi) and N{B.2) images and the existing CO data for Perseus allow us to test two recent models 
of H2 and CO formation in equilibrium. For several dark and star-forming regions (B5, BIE, Bl, IC348, and 
NGC1333), we investigate Shi and i?H2 as a function of Shi + Sh2 and compare the results to KMTOQ's 
predictions. We also compare the H2 and CO distributions and estimate the fraction of H2 mass in the 
"CO-dark" component, /dg- We investigate /dg as a function of Ay and compare the result to WHMlO's 
prediction. Several of the main results can be summarized as follows: 

1. For both dark and star-forming regions in Perseus, we find an almost uniform Shi ^ 6-8 Mq pc~^. 
This is consistent with KMTOQ's prediction of the Hi shielding surface density for solar metallicity, Shi,s ^ 10 

M0 pC-2. 

2. The purely atomic envelopes are located more than ^20 pc from the centers of the dark and star- 
forming regions. We probe the Shi and Shi + Sh2 distributions up to ~20-30 pc from the centers of the 
regions and barely detect the turnover between atomic- and molecular-dominated zones for IC348, BIE, and 
Bl. 

3. The relation between i?H2 and Ehi + Sh2 on a log-linear scale is remarkably consistent for all dark 
and star- forming regions, having a very sharp rise of Ru2 at low Shi + Sh2, a turnover around Ru2 ~ 1, 

and a slow increase toward higher i?H2- 

4. The dark and star-forming regions have similar (/>cnm = 6-10, which implies that these regions are 
embedded in the atomic envelopes with ncNM = 20-30 cm~^. In addition, ^cnm = 6-10 indicates a similar 
normalized radiation field X 1) suggesting that dust shielding and H2 self-shielding are equally important 
for H2 formation. 

5. We measure the H1-H2 transition gas column density of 7V(Hl) + 27V(H2) = (8-14) x 10^" cm~^ or 
Shi + Sh2 = 6-12 M0 pc^^ at _Rh2 ^ 0.25. This is consistent with the previous estimates for the Galaxy. 

6. We estimate the fraction of "CO-dark" gas. /dg ^ 0.3, for Perseus, a factor of three lower than 
WHMlO's prediction. This implies that a significant amount of H2 gas is not traced by Galactic CO surveys. 

7. We investigate the dependence of /dg on Ay and find a relatively large variation of /dg ~ 0.1-0.3 
for similar Ay ~ 2 mag. This is not consistent with WHMlO's prediction that /dg is a strong function of 

Ay. 

While we generally find good agreement with KMTOQ's predictions, several outstanding questions re- 
main. Our estimated Shi ^ 6-8 Mq pc~^ is likely dominated by WNM, while KMTOQ's model predicts 
shielding by CNM alone. Clearly, the WNM in the atomic envelopes of GMCs needs to be considered in 
the models of H2 formation as a source of shielding against photodissociation. The importance of gravita- 
tionally bound vs diffuse gas is another important aspect, as explored in Ostriker et al. (2010). In addition. 
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KMT09's model docs not take into account the role of internal radiation for H2 formation. This is valid for 
Perseus as it does not contain early-type stars to photodissociate a significant amount of H2 that could be 
detected at our spatial resolution. However, if GMCs are young enough to harbor early-type stars that are 
able to change the thermal and chemical structures of the GMCs, the internal radiation cannot be ignored. 
Most importantly, our Perseus data on sub-pc scales are consistent with KMT09's model for H2 formation 
in equilibrium and suggest that turbulence may not be of primary importance for H2 formation. Yet, the 
timescale for equilibrium H2 formation is longer than the estimated lifetime of most GMCs. This problem 
remains to be resolved. Future detailed comparisons of observations with non-equilibrium, turbulence-driven 
H2 formation models will be important to investigate the role of turbulence for formation of H2 and various 
molecular species. 
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TABLE 1 

Fitting Results for i?H2 vs Shi + Sh2 



Region 


^GNM 


B5 


9.61 ± 1.32 


IC348 


8.37 ±0.88 


BIE 


8.32 ± 1.22 


Bl 


7.95 ± 1.06 


NGC1333 


6.26 ± 0.82 



" Uncertainties in (/>cnm are estimated from the distribution of simulated 1000 0cnm. 



TABLE 2 

Properties of Ru2 Radial Profiles 



Region 




(5^ 


al 


a2 




-^b,H2 


rt 


r'm 


4 




(J2000) 


(J2000) 






(') 




(') 


(') 


(') 


B5 


03 47 39 


+32 51 28 


3.69 + 0.15 


0.44 + 0.01 


~90 


~0.7 


-120 


-160 


-30 (< 0.19) 


IC348 


03 44 13 


+32 04 13 


3.22 ± 0.06 


0.57 + 0.01 


~70 


~1.2 


-120 


-150 


-50 (< 0.33) 


BIE 


03 36 45 


+31 12 40 


1.67+0.02 


0.42 + 0.01 


-30 


-1.2 


-90 


-150 


-60 (< 0.40) 


Bl 


03 33 18 


+31 08 23 


2.21 + 0.05 


0.70 + 0.01 


~60 


-0.8 


-110 


-170 


-50 (< 0.29) 


NGC1333 


03 29 17 


+31 21 16 


1.43 + 0.04 


0.61 + 0.01 


~60 


~0.5 


-100 


-190 


-40 (< 0.21) 



Central position of each region. For B5 and Bl, the centers correspond to the /i3co peaks as their Rn2 peaks are 
masked. For IC348, BIE, and NGC1333, the centers correspond to the Rh2 peaks. 
^ Radius where the transition from the steep to shallow power-law functions occurs. 
" Rh2 at rb. 

Radius where the H1-H2 transition occurs. This radius is defined at Rm — 0.25. 
" Radius where gas is mainly in Hi. This radius is defined at Rh2 — 0.1. 

^ Thickness of the Hi H2 transition region. This is measured as n — rb- The values in parantheses are dt/rm- 
^'"•'^'^ Estimated by eye. 

b,d,e,f j^/ corresponds to 0.09 pc at the distance of 300 pc. 



TABLE 3 

/dg and Av for Perseus 



Region 


/dg 


Av (mag)'* 


1 


0.32 


1.71 


2 


0.31 


1.73 


3 


0.18 


1.58 


4 


0.10 


1.43 



^ Icr uncertainty is less than 1% of each quantity. 
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Fig. 1. — 2MASS Av image of Perseus. The contours of CfA Iqo are overlaid in black. The contour levels 
range from 10% to 90% of the peak (69 K km s"^) with 10% steps. The angular resolution of the 2MASS 
Ay and CfA Iqo images is 5' and 8.4', respectively. Several prominent dark and star-forming regions are 
labeled. 
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Fig. 2. - Hi (thick) and CO (thin) spectra at the position of IC348. For comparison, CO is ampUfied twice. 
The main component of Hi peaks at u ~ +5 km s~^ and is offset by 2-3 km s~^ from the peak of CO. 
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Fig. 3. — Correlation coefficient between iV(Hl) and 2MASS Ay as a function of velocity width Av. Data 
points with SNR > 5 in the 2MASS Ay image are used to calculate the correlation coefficient. 
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Fig. 4. — A^(Hi) image at an angular resolution of 4.3'. Hi emission is integrated from v = —5 km s~^ to 
+15 km and A''(Hi) is derived under the assumption of optically thin Hi gas. 



- 32 - 




Fig. 5. — (Left) /go as a function of A^(Hl). (Right) /loo as a function of iV(Hl). The gray data points are 
assumed to mostly trace Perseus and the black data points are likely from the Taurus molecular cloud. See 
Section 4.2. 
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Fig. 6. — IRIS 100 /xm image with the black data points from Figure 5. 
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T<,„.. (K) 




Right Ascension (J2000) 

Fig. 7. — Tdust image derived from the IRIS 60 fim and 100 /im images after accounting for the contribution 
from VSGs. Several B-type stars as potential dust heating sources are marked; B8 V-type stars (BD+30°540 
and BD+30°549) are shown as blue crosses (Cernis 1990); B5 V-type stars (BD+31°643 and SVS3) are shown 
as red stars (Cernis 1990; Luhman et al. 2003); B4 IV- V-type star (BD-|-29°566) is shown as a red triangle 
(Cernis 1990). 
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Fig. 8. — Ay as a function of A^(Hi). The contour levels correspond to 50%, 30%, and 10% of the total 
data points. The black circles are median Ay values in 3 x 10^^ cm~^ wide bins. The linear least-squares 
fit for the median Ay values is shown as a black dashed line. 
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Fig. 10. — i?H2 image. The red rectangular boxes show the boundaries of each dark and star-forming region 
set by the COMPLETE 13CO(1-0) data. 
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— (Continued) (Left) Bl. (Right) NGC1333. 
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Fig. 12— (Continued) (Left) Bl. (Right) NGC1333. 



- 42 - 




'"3 / ™3 = "a 




<jj o 

§ 8 

CO +^ 



LO 

m 

'a 
o 



O 

a 

o 
o 
00 



CO 

i 

X2 



13 



0) 

,£3 



a 

cz> 
O 



13 



T3 
O 



o 



bjO 



o g 

CO ^ 
13 



o ^ 

■ I 



03 O 



03 <p 

H ^ 

CO n; 

o 



T3 

CD 

13 

0) m 



'"2 / "2 = '"a 



'"3 / - ™^ 



CO a 
^ > 



- 43 - 



" 1.0 




Rodius (orcmin) 



10 100 
Rodius (orcmin) 



Fig. 13— (Continued) (Left) Bl. (Right) NGC1333. 
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Right Ascension (J2000} 

Fig. 14. — Sh2 image. The contour for SNR of Eh2 = 3 is overlaid in gray. The contour for SNR of CfA 
^co = 3 is also overlaid in black. The angular resolution of the I]h2 and CfA Iqo images is 4.3' and 8.4', 
respectively. The Sh2 image is divided into four regions and each region has either a dark or a star-forming 
region. 
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